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Chapter 1 

Introduction 



In 1984 Berry [1] addressed the case of a quantum system undergoing a unitary, cyclic evolution 
under the action of a time-dependent Hamiltonian. The evolution was supposed to be adiabatic, 
i.e., the time scale at which the quantum system evolved was supposed to be much shorter than 
the time scale of the changing Hamiltonian. Until Berry's study it was assumed that, after a cycle, 
the quantum state would acquire a so-called dynamical phase, with no physical meaning. Such 
a phase could be eliminated by redefining the quantum state through a "gauge" transformation 
of the form \ip) — > e la \ip). However, Berry discovered that there was a phase, additional to 
the dynamical phase, that could not be "gauged away" and whose origin was "geometric" or 
topological, i.e., it depended on the path that \ip) traced out in the parameter space defined by 
those parameters to which the Hamiltonian owed its time dependence. Berry's discovery was the 
starting point for a great amount of investigations that brought to light topological aspects of 
both quantum and classical systems. Berry's phase was soon recognized as a special case of more 
general phases that showed up when dealing with topological aspects of different systems. For 
example, the Aharonov-Bohm phase could be understood as a geometric phase. The rotation 
angle acquired by a parallel-transported vector after completing a closed loop in a space-time 
region deformed by the presence of a gravitational field is also a geometric, Berry-like phase. The 
same holds true for the precession of the plane of oscillation of a pendulum, due to the Earth 
rotation (a Foucault pendulum). 

Berry's original formulation was directly applicable to the case of a spin-1/2 system evolving 
under the action of a slowly varying magnetic field that undergoes cyclic changes. Now, a spin- 
1/2 system is a special case of a two-level system. Another instances are two- level atoms and 
polarized light. Also here we can expect to find geometric phases. In fact, the first experimental 
test of Berry's phase was done using polarized, classical light [2]. Pancharatnam [3] anticipated 
Berry's phase when he proposed, in 1956, how to decide whether two polarization states are, or 
are not, "in phase". Pancharatnam's prescription is an operational one, based upon observing 
whether the intensity of the intcrfcrogram formed by two polarized beams has maximal intensity. 
In that case, the two polarized beams are said to be "in phase" . Such a definition is analogous to 
the definition of distant parallelism in differential geometry. Polarized states can be subjected to 
different transformations which could be cyclic or not, adiabatic or not, unitary or not. And in 
all cases Pancharatnam's definition applies. Pancharatnam's phase bore therefore an anticipation 
and - at the same time - a generalization of Berry's phase. Indeed, Berry's assumptions that 
the evolution was cyclic, adiabatic and unitary, showed up to be unnecessary conditions for a 
geometric phase to appear. This was made clear through the contributions of several authors 
that addressed the issue right after Berry published his seminal results [4, 5]. 

Pancharatnam's approach, general as it was when viewed as pregnant of so many concepts 
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related to geometric phases, underlay however two important restrictions. It addressed non- 
orthogonal and at the same time pure, viz totally, polarized states. Here again the assumed 
restrictions turned out to be unnecessary. Indeed, it was recently proposed how to decide whether 
two orthogonal states are in phase or not [6] . Mixed states have also been addressed [7] in relation 
to geometric phases which - under appropriate conditions - can be exhibited as well-defined 
objects underlying the evolution of such states. 

The present Lectures should provide an overview of the Pancharatnam-Berry phase by in- 
troducing it first within Berry's original approach and then through the kinematic approach [8] 
that was advanced by Simon and Mukunda some years after Berry's discovery. The kinematic 
approach brings to the fore the most essential aspects of geometric phases, something that was 
not yet accomplished when Berry first addressed the issue. As has often been the case in physics, 
a fundamental discovery that is made by addressing a particular issue, can show afterwards to 
bear a rather unexpected generality and applicability. Berry's discovery ranks among this kind 
of fundamental advances. 



Chapter 2 

The adiabatic and cyclic case 



2.1 A general approach 

Let us consider a time-dependent Hamiltonian H(t) and a state \i>(t)) evolving under its action: 

ih^ |V) = H{t) |V> . (2.1) 
The eigenvalue equation for H(t) can be solved for each t: 

H(t) \a(t)) = E(t) \a(t)) , (2.2) 

whereby we assume that the eigenvalues E(t) are non-degenerate. The phase of \a(t)) can be 
freely chosen. Let us choose it so, that 

(a(t)\a(t)) = 0. (2.3) 

Then, under the assumption of adiabatic evolution and the system starting in the cigcnstatc 
|a(0)), it can be shown that an approximate solution of Eq.( 2.1) is given by 



Wt)>«exp(-ijT E(s)d8^ \a(t)). (2.4) 

This result was obtained by Fock in 1928 [9], and for half a century it was taken for granted 
that by a phase choice Eq.(2.4) would give an adiabatic solution. However, Fock did not consider 
cyclic Hamiltonians. For a cyclic Hamiltonian whose period is T, we have that H{T) = H(0). 
The question addressed by Berry was whether \ip(T)) = \ip(0)) = |o(0)), up to the "dynamical" 

phase-factor exp ^— i E(s)ds/hj . What Berry found was that \tp(T)) = cxp {—i&tot/K) | ^A(O) } ; 
but that the total, accumulated phase <j> tot was not, in general, the dynamical phase. Indeed, 
Berry found that <& to t contained besides the dynamical, a geometric part: 

<&tot = * S +<S>dyn- (2.5) 

That is, besides the expected dynamical part 

$dyn= [ E(s)ds= [ (a(s)\H(s)\a(s))ds, (2.6) 
Jo Jo 

there was an additional contribution <i> 9 whose geometrical origin we will elucidate in what follows. 
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Let us take the following ansatz as a solution for Eq.(2.1): 



m)) « cxp ( U{t) -if E(s)ds ) \a(t)) , (2.7) 



o 



with 7(0) = 0, so that initially \ip(0)) = |o(0)). Then, 



ih^ty) = (-^t) + E(t))\^+ihe^^(t)- l -^E(s)d s ya(t)) (2.8) 
= H(t)\^) + exp(^r(t)-^J E(s)d s y-j(t)\a(t)) + ih\a(t))] (2.9) 



For Eq.(2.1) to be satisfied, we can take 

7(t) « ift<a(t)|a(t)>, (2.10) 

7(t) m ih ( (a(s)\a(s))ds. (2.11) 
Jo 

Now, because -ff(T) = H(0), and because we have assumed non-degenerate eigenvalues 
(E(T) = E(0)), the cigcnstates \a(T)) and |a(0))can differ from one another only through a 
phase-factor that can be set to one: \a(T)) = \a(0)) . We have thus, because of Eq.( 2.7) and 
\a(T)) = |o(0)) - |V(0)), that 

|V(T)} « exp U 7 (T) - ^ / T 25(*)<k | |V(0)) • (2.12) 



o 



Note that Eq.( 2.7) cannot hold exactly, but only as an approximation. This is because the 
condition we have imposed on 7(f), Eq.(2.11), requires that \a(t)) satisfies the equation 

\a(t)) = (a(t)\a(t))\a(t)), (2.13) 

for the second term in Eq.(2.9) to be zero. Such a condition is generally not satisfied. 

In order to see that j(t) has a geometrical origin we need to make our treatment more specific. 
By so doing, we will also introduce some concepts that will be useful for several purposes. 

2.2 Kerry's approach 



2.2.1 Berry's phase 

The time-dependence of H(t) means that the system under consideration is non-conservative. 
This happens when the system is part of a larger system, i.e., when it evolves under the influence 
of an environment. The configuration of the environment can generally be specified by a set of 
parameters (R\, R2, ■ ■ •)■ For a changing environment the Ri are time-dependent, and so also the 
observables of the system, one of which is the Hamiltonian: H(R(t)) = H (Ri(t) , R 2 (t) , . . .) — 
H(t). The evolution of the quantum system is ruled by the Schrodinger equation, Eq.(2.1), or 
more generally, by the Liouville-von Neumann equation 



1 This Section closely follows Ref. [10] 
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ih^- = [H(R(t)),p(t)] (2.14) 

for the density operator p. For a pure state p(t) — \tp(t)) (ip(t)\. An environmental process is given 
through t — > R(t), that is, through the curve that the parameters R describe in the parameter 
space. To such a process it corresponds a curve described by the state vector \ip(t)) in the Hilbcrt 
space Ti. to which it belongs, as well as a corresponding curve described by p(t) = \ip(t)) {tp(t)\ 
in the "projection space" V(H) to which p belongs. We will assume that for any R, there is an 
orthonormal basis \n;R) such that 

H{R) |n; R) = E n (R) |n; R) . (2.15) 

An environmental process R(t) is called periodic with period T, whenever R(T) = R(0), 
E n (R(T)) = E n (R(0)), and \n; R(T)) (n; R(T)\ = |n; R(0)} (n; R(Q)\. In general, however, the 
\n;R) need not be single- valued: 

\n; R(T)) = e M " \n; R(0)) , (2.16) 

in general. Of course, we can change the eigenbasis according to \n; R) — ► \n; R}' — e tan ^ \n; R). 
This is called a gauge transformation. 

The adiabatic approximations consists in taking as an approximate solution of Eq. (2. 14) the 
following one: 

P (t) = \m) W)\ « l«; m) (»; • (2-17) 

When i?(t) traces back a closed path (R(T) = R(0)) so also does p(t) under the adiabatic 
approximation: \ip(T)) (ip(T)\ w |n; #(T)) (n; i?(T)| = |n; R(0)} (n; i?(0)|. However, the state 
\ip(t)) itself may acquire a phase. That \ip(t)} ~ |n; (n; can only be an approx- 

imation follows from observing that [H (R(t)) , \n; R(t)) (n; R(t)\] = 0, so that only stationary 
states \tp(t)} {tp(t)\ = \n;R(t)) (n;R(t)\ could be solutions of Eq.(2.14). In general, the adiabatic 
assumption is incompatible with the requirement imposed by Eq.(2.14) for the evolution of the 
physical states. Let us see under which conditions holds the adiabatic approximation. In general, 
we can write \i>(t)) — ^Z k Ck{t)\k; R(T)) . If |-0(O)) = \n;R(0)), the adiabatic approximation 
means that \il>(t)) w c n (t) \n;R(t)), with c„(0) = l.By replacing such a \tp(t)) in the Schrodingcr 
equation one easily obtains the necessary and sufficient condition for the adiabatic assumption 
to hold true: 

(fc;i2(T)|^|n;i2(t))«0, for all k + n. (2.18) 
By using Eq.(2.15) this condition can be written as 

MggMNjWU o, for alU ^ n. (2.19) 

Hence, the transition frequencies of the evolving system, (E n (R) — E k (R))/h, set the time 
scale for the variation of H~(t) to be considered as "adiabatic". 

Now, the coefficient c n (t) in the adiabatic approximation \ip(t)) w c n (t) |n; R(T)) must satisfy 
the equation 



den _ _ 
dt ~ ° n 



iE n (t) + (n;R(T)\j t \n;R(t)) 



(2.20) 



whose solution is 
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c n (t) = cxp 
Hence, 



-i / E n (s)ds 
Jo 



cxp 



* d 
(k;R(s)\ — \n;R(s))ds 



= Cxp (-i®dyn(t)) exp {ij n {t)) ■ 

(2.21) 



\ip(t)) w c n (t) \n; R(t)) = cxp (i$ dyn (t)) exp |n; R(t)) . 

The geometric phase 7„ can also be written as 



(2.22) 



(n;i?(s)| — |n;i?(s))ds 
ds 



ln{t) = i \ 

Jo 

f R(t) q f R(t) 

= % (n;R\ — \n;R)dR k = AW • dR, 



(2.23) 
(2.24) 



with the vector potential A^™' = i (n; i?| V |n; i?), which is named the Mead-Berry vector potential. 
Eq.(2.24) makes it clear that -f n depends on the path defining the environmental process, viz., 
the path connecting the points R(0) and R(t) in parameter space. This gives us a clue as 
to the geometrical nature of 7„. Now, one can straightforwardly prove that under a gauge 
transformation |n; R) — ► \n; R)' = e M "^ fl ^ \n; R) the vector potential transforms as 



AW — ► A'W = aW - Va n (R). 



(2.25) 



As a consequence then, 



7n(t) — » -/„(*) - 7n(*) - K («(*)) - «.,(#))] 



(2.26) 

and we can use this gauge freedom to remove the phase factor exp(i7„) in Eq.(2.22). Indeed, 
we can repeat the calculations leading to Eq.(2.22) but using \n; R)' = e la ^ |n; R) instead of 
\n;R). We thus obtain an equation like Eq.(2.22) but with primed quantities. We can then 
choose a n (R(t)) = — j' n (t) (modulo 2ir) and so obtain 

|VW) w exp (i$ dvn (t)) |n; i?(t)) . (2.27) 

This is what Fock made, exploiting the freedom one has to choose a n (R(t)). However, when we 
have a closed path C we cannot do this anymore. This is because R{T) = R(0) and exp (ia n (T)) = 
exp (ia„(0)), so that a n (T) — a n (0) modulo 2ir. Hence, because of Eq.(2.26), 



7„(T) — » y„(T) = 7n (T) - 27rm, 



(2.28) 



with m an integer. We conclude that "f n {T) is invariant, modulo 27r, under gauge transformations, 
and we have finally, 



with 



|V(T)) = exp (-i$ d!/n (T)) exp {i ln {T)) |^(0)) 

$d y ™(T) = / E n {t)dt, 
Jo 

ln{T) = j> A (n) ■ dR. 



(2.29) 

(2.30) 
(2.31) 



2.2. 3 BERRY'S APPROACH 



11 



The vector potential A^™-* behaves very much like an electromagnetic potential. The phase 
factors exp(ia„(i?)) form the group U(l), hence the name "gauge transformations" given to the 
transformations \n; R) — > \n; R)' = e 1 "™^ \n;R). As in electromagnetism, we can also here 
introduce a field tensor F^ n ^ whose components are 

f(") — d An) 9 ( n ) . . 

Geometrically, has the meaning of a "curvature" . In differential geometry, where the lan- 
guage of differential forms is used, to A ( n > it corresponds a one-form, while to ¥^ n ' it correspond 
a two- form. 

When the parameter space is three-dimensional, Eq.(2.32) can be written as 



F ( "'=VxA w . (2.33) 



Eq.(2.31) can then be written as 



7 „(T)= / F^-dS, (2.34) 
Js 

with the surface element dS directed normally to the surface S whose boundary is the curve C. 

As we have seen, Berry's phase 7«(r) has been introduced by considering unitary, cyclic 
and adiabatic evolutions. This was Berry's original approach. Soon afterwards his approach 
was generalized by Aharonov and Anandan [4] to the non-adiabatic case, and by Samuel and 
Bhandari [5] to the noncyclic case. Unitarity was also shown to be non-essential by addressing 
the issue from a purely kinematic point of view. This was done by Mukunda and Simon [8], 
whose approach we present in the next Section. Before that, it will be useful to discuss in more 
detail some topological aspects of Berry's phase. They will illustrate how deep can go the notion 
of a geometric phase. 

2.2.2 Angular momentum and monopoles 



A paradigmatic case in which Berry's phase has been amply studied is that of a quantum spin- 
1/2 particle subjected to the action of a rotating magnetic field. We can write such a field as 
B(t) = Bn(t), with a rotating unit vector n(i). The parameter space of this system is thus S 2 , 
the unit sphere in M 3 . The Hamiltonian of this system is 

H(R(t)) = -B-^n(t) • S, (2.35) 
Zmc 

where e is the electron's charge, g the Lande factor, and S =ffa /2, with ~a representing a vector 
of Pauli matrices. For particles of arbitrary spin the above Hamiltonian reads 

H(n(t)) = -B^-n{t) ■ J =bn(t) ■ J. (2.36) 

When B(t) rotates uniformly with angular velocity lo around the z-axis its component B 3 
remains constant. Let us write B(t) = £?(cos uitsin9, sin cutsin9, cos 9). The eigenvectors of the 
corresponding Hamiltonian can be shown to satisfy 



with 



H(n(t))\k;n(t)} = bn(t) ■ 3\k; n(i)) = bk\k; n(i)), 



(2.37) 
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J 3 |ft;n(0)) = fc|ft;n(0)). (2.38) 

This immediately follows from noting that H(n(t)) and J 3 are related to one another by a uni- 
tary transformation: H(n(t)) = bU(9,ojt)J 3 U^ 1 (9,ujt), with U(6,wt) = cxp (— iuot J 3 ) exp (— iO J 2 )- 
We note that the eigenvalues of H(R) are constant (E(R(t)) = bk), although its eigenvectors are 
not: |fc;n(i)) = U(6, ut)\k; n(0)). 

If the system has spherical symmetry, and hence H(R) commutes with J 2 , then the \k; R) 
are also eigenvectors of J 2 for eigenvalues that can be denoted as j(j + 1). In such a case, the 
values of k are given by k = + — and it can be shown that j must be an integer 

or a half-integer, so that k — 0, ±1/2, ±1, ±3/2, But there are cases in which H(R) does 

not commute with J 2 , as it occurs, e.g., when the system has only axial symmetry (a diatomic 
molecule would be an important example of this case). Nevertheless, also in such a situation 
one can derive from considerations based on Berry's phase that k — 0, ±1/2, ±1, ±3/2, . . ., as 
we shall sec. This nicely illustrates how Berry's approach can bring out rather unexpectedly the 
topological root of some known results. In the present case these topological properties are also 
connected with Dirac's monopoles. 

The topology we are referring to in the present case is the topology of the unit sphere S 2 . 
As well known, it is not possible to assign coordinates to all points of S 2 by a single patch. 
We need at least two of them. To be concrete, let us use spherical coordinates to parametrize 
the points of S 2 : then, all of S 2 , with exception of two points, is univocally given by n(#, ip) — 
(cos ip sin 9, sin ip sin 9,cos0), with < 9 < tt and < p < 2ir. The exceptional points are the 
north pole e 3 = (0, 0, 1) and the south pole — e 3 , for which the value of p is arbitrary. 

Let us see the consequences that the topology of S 2 has for Berry's phase. To this end, let us 
consider the following Hamiltonian, a particular case of which was the Hamiltonian of Eq.(2.36). 
Let us define H(n(9,p)) — bn(9 7 ip) ■ J. Its eigenvectors can be obtained from those of J3 by a 
unitary transformation. Indeed, let us write J 3 |fc;e 3 ) = fc|fc;e 3 ) and denote by |fc;n(0, ip)) the 
eigenvectors of n(9, <p) ■ J. Last ones can be obtained from the |ft;e 3 ) by applying to them a 
rotation that brings e 3 to n(9,<p): lZ(9,p)e 3 = n(9,ip). A convenient choice for lZ(9,ip) is given 

by 

U(0, p) = K 3 (v)n 2 (9)K 3 (-<p). (2.39) 

72.3 (y) 72-2(0) would also bring e 3 to n(9,ip); but the last factor 72 3 (— (p) in Eq.(2.39) serves to 
make 72(0, p) — I. The unitary transformation corresponding to 72(0, p) is 

Un(9, p) — exp (—ipj 3 ) cxp {—i9J2) exp {ipJz) . (2.40) 

Thus, 

\k;n{e,ip)) N = \k ] e,<p) N = U N {9,ip)\k;e 3 ). (2.41) 

Our choice of Un(9,p>) smoothly assigns to each n(6>, <p) a unique eigenvector \k;9,(p), with 
the exception of the south pole (hence the label N), for which 9 = tt. In such a case, using that 
exp (— in J2) exp (i<pJs) = cxp (— z<pJ 3 ) exp (— in J 2) , Eq.(2.41) gives 



\k;n, p) N = exp (—i<p J3) exp (—in J2) exp (iip J 3 ) |fc;e 3 ) = exp (—in J 2 ) exp (2i<p J 3 ) |fc;e 3 ) 

= cxp(-inJ 2 )e 2lklfi \k;e 3 ) (2.42) 

Thus, for the different values of ip in the interval < p> < 2ir we obtain different state vectors 
I ft; 7r, ip), so that we have no single- valued assignment at the south pole for this parametrization 
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of S 2 . At the north pole the assignment is single- valued, because 1Z(0, <p) — I, as we have seen. 
We thus have \k;0,<p) = |fc; 0, 0> = |fc;e 3 ). 

A parametrization that is single-valued at the south pole can be obtained by gauge trans- 
forming the \k; 9, ip) according to 



\k;8, tp)s = e~ l2kv \k;9,ip) N = exp(-i(pJ 3 )cxp(-i9J 2 )cxp(-i(pJ 3 ) |fc;e 3 ) = U s (9,(p)\k;e 3 ). 

(2.43) 

At the south pole we obtain, similarly to what we did before, that \k;w,tp)s = e~ tvj2 \k;e 3 ), 
so that we have now a single- value assignment at this point. However, at the north pole we have 
now the multiple-valued assignment |fe;0, <p)jv = e~ l2,rv |fc; e 3 ). In order to have a single-valued 
assignment in all of S 2 we need at least two patches: On = S 2 — {— e 3 } and Os = S 2 — {e 3 }. In the 
overlap OpjHOs either parametrization can be used. Corresponding to these two paramctrizations 
we have two vector potentials: A„ = i m (k; R\ V^|fc; R) m , m = N,S. For On the eigenvectors 
are given by \k;9, (p)n — Un(9, <p)\k;e 3 ) , while for Os they are \k;6,<p)s = Us(9,ip)\k;e 3 ). We 
can then calculate 

A® = * <*;e 8 | Ul (e r l + *e\l e+ e v ^)uL \k; e 3 ) . (2.44) 

We obtain 



< - k{COS9 : 1) e v , e^ir, (2.45) 

Affi = ^^e„ ^0. (2.46) 
s r sin 6 

We see that and are related to each other by a gauge transformation: 

Ag! = Ag> + ^- 9 e v = A%1 Va, with a = -2k V . (2.47) 
The corresponding field tensor F( fc ) = V x A^ fe ' is given by 

F< fc ) = -^e r . (2.48) 

We recognize here the field produced by a monopole located at the origin of our coordinate 
system. In terms of this field we can now calculate the geometric phase corresponding to a closed 
curve C. It is given by 



Ik 



(C)= f F^-dS= f F*r 2 sm9d9dip = -k f sin9d9d<p=-k f dSl, (2.49) 

J S J S J S J s 

with S being any surface whose boundary OS — C, and dfl the element of solid angle, or area 
element of the unit sphere. We have thus the important result 

lk (C) - -kfl(C). (2.50) 

7fe(C) is gauge-independent, as F^ fe ^ is. Our calculation is therefore independent of what 
surface S we choose. For example, S could be contained in On, in which case its normal points 
out of the sphere S 2 . If instead S C Os, its normal points into the sphere S 2 . Let us denote 
the first surface by Sn and the second one by Ss- They have as common boundary C and their 
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union is the whole sphere S 2 . Hence, because of the opposite pointing directions of the normals 
to Sn and to Ss, we have 

7fe(C) = k f dn = -m{C) = k ( f dn- ( dn)=k(4n- n(c)) + 2™, 

n = ±0,±l,±2,..., (2.51) 

where we have included the term 2im to take into account that 7fc(C) is defined only modulo 2it. 
From Eq.(2.51) we see that —kfl(C) = Auk — kQ(C) + 2mr, n = ±0, ±1, ±2, . . ., and so we arrive 
at the conclusion that k = 0, ±1/2, ±1, ±3/2, . . . without having assumed spherical symmetry: J 2 
may not commute with H(R). This illustrates how some results can have in fact topological roots. 
In the present case, it is the topology of S 2 that shows up as a common root to the geometric 
phase and to monopoles. In both cases the topology of the parameter space shows its non-trivial 
structure. In the case of the monopole we can say that it represents a singularity which makes a 
space region multiple-connected. In the case of the sphere, its topology precludes a covering by 
a single patch. In both cases, the corresponding vector potentials A are non-integrable. There is 
no single- valued function a(R) for which A = Va. 



Chapter 3 

The kinematic approach 



3.1 Total, geometric, and dynamical phases 

We will see now how all of the essential features we have seen before emerge from a purely 
kinematic approach. Let us start by defining Ho as the subset of the Hilbert space H whose 
elements arc normalized nonzero vectors \ip). A curve Co in Ho is defined through vectors \ip(s)) 
that are continuous functions of some parameter s G [si,S2]- Because \ip(s)) is normalized, 

(ip(s)\ip(s)) + (ip(s)\ip(s)) = 0, which means that Re(tp(s)\ip(s)} — 0, so that 

(V(s)|V>(s)) = i/m(V(s)|V>(s)). (3.1) 

Under the gauge transformation IV'(s)) — > \ip'(s)) — exp(ia(s)) \tp(s)) the curve Co — > C and 
Im(ip(s)\ip(s)) — ► Im(tjj' (s)\ip' (s)) — Im(^p(s)\ip(s)) + a(s). This suggests that we define the 
following gauge-invariant quantity, as a functional of the curve Co, and hence of C : 

<& ff (Co) = arg^si) |V(* 2 )) - Im f" \i>{s))ds. (3.2) 

For such a quantity holds true not only that it is gauge invariant, i.e., < f ) 3 (Co) = <& S (C ), 
but it is also invariant under changes of the parameter s. That is, <I> g remains the same if we 
replace s by another parameter s' which is a monotonically increasing function of s. Gauge and 
reparametrization invariance mean that in spite of having defined $ g in terms of \ip(s)) and the 
curve Co, it is in fact a quantity that depends on some equivalent classes of |VK S )) an d Co- Indeed, 
the set {|V>') = ex P {ioi) \^)} whose representative element is |V>), constitutes an equivalence class. 
The space spanned by such equivalence classes is called the "ray space" IZq. Instead of working 
with equivalence classes we can work with projectors: \ip) whereby each projector corresponds 
to an equivalence class. That is, the whole set of vectors {1^') = exp (ia) projects onto the 
object 1^) (V'l by means of a projection map tt : Ho — > Tlo- In particular, the curves Co, C' 
which are connected to one another by a gauge transformation, constitute also an equivalence 
class. Under tt, they project onto a curve C C 1Z - What we have seen above is that <I> g is in fact 
a functional not of C , or C' , but of C , which can be seen as the curve defined by \ip(s)) {ip(s)\. 
We call then <& g the "geometric phase" associated with the curve Co C IZo- We should then 
better write <& fl (Co), though the actual calculation of such a quantity requires that we choose 
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what is called a "lift" of Co; that is, any curve Co in the equivalence class that projects onto Co- 
tt(Co) - C . 

& g (C ) has been defined as a sum of two terms in Eq.(3.2). Each of these terms deserves a 
separate definition: 

$>tot{Co) = arg(V(si)|V(s 2 )), (3.3) 
QdyniCo) = Im f 2 {ij;{s)\^{ S ))ds. (3.4) 

J 81 

$ tot (Co) is the ioiaZ p/iase of Co, also known as the Pancharatnam phase. It is thus defined 
as the argument of the complex number (tp(si)\i(>(s2)} = \{ip(si)\ip(s2)}\ e la . Later on, we will 
discuss the physical meaning of this phase in the context of polarized states, the case addressed 
by Pancharatnam. $dyn{Co) is the dynamical phase of Co- We see thus that even though both 
&tot(Co) and &dyn(Co) are functional of Co their difference, & g , is a functional of Co: 

$ S (C ) = * tot (Co) - $dyn(C )- (3.5) 

Let us stress that the present definition of the geometric phase does not rest on the assump- 
tions originally made by Berry. <& s (Co) has been introduced in terms of an evolution of state 
vectors |^>(s)) that does not need to be unitary nor adiabatic. Furthermore, the path Co does 
not need to be closed and, hence, no cyclic property needs to be invoked. Now, for a given Co 
in ray space, we can choose different lifts in order to calculate <& ff (Co). We can exploit this free- 
dom to express <!> 9 (Co) as it is most convenient to our tasks. For example, we can always make 
^tot(Co) = 0, by properly choosing the phase of, say, \ip(s2)}- In that case, <I>g(Co) = —<&dyn{Co)- 
On the other hand, we can also make &dyn{Co) = by choosing what is called a "horizontal 
lift" . This is a lift for which Im(tp(s^\ip(s)) = 0. Because, as we saw, Re(ip(s)\ip(s)) = 0, for 
a horizontal lift it holds that (tp(s)\ip(s)) = and <& ff (Co) = 3>tot(Co)- ^ n or dcr to obtain a 
horizontal lift we need only submit, if necessary, an arbitrarily chosen lift, that is defined by 
\ip(s)}, to a gauge transformation. That is, we change \ip(s)) — > \tp'(s)) — exp(ia(s)) \i(>(s)), as 
a consequence of which Im(ip(s)\ip(s)) — ► Im(ip' (s)\ip' (s)} = Im(ip(s)\ip(s)) + a(s). We then 
require that Im(il>' '(s)\ip' '(s)) = 0. This can be attained by taking 

a{s) = -Imf (ip{s)\ip{s))ds, (3.6) 

J 81 

where we have assumed that a(s\) = 0, i.e., that \ip'(si)) = \ip(si)). 

The remarkable fact that <& g (Co) depends only on ray-space features makes it desirable to 
find an expression of it that involves quantities of such a space. In order to achieve this, let us 
consider the operator K(s) = p(s) = d(\ip(s)) (ip(s)\)/ds. We can see that its action on \ip(s)} 
gives 

K(s)\iP(s)) = |V>( S )) - (yj(sM( S )m S )). (3.7) 

K(s) is invariant under a gauge transformation \ip(s)) — > \ip'(s)) = exp (ia(s)) \ip(s)), i.e., 
K'(s) — K(s); hence, Eq.(3.7) holds also for the primed quantities. Let us now assume that we 
are dealing with a horizontal lift, that is, (tp(s)\tp(s)) = 0. As we have seen, this can always be 
achieved by a proper gauge transformation. For a horizontal lift Eq.(3.7) reduces to 



(3.8) 
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The solution of Eq.(3.8) can be formally given as a Dyson series: \tp(s)) = P (cxp p(s)ds^J \ip(si)}. 
The operator P is the "parameter-ordering" operator: it rearranges a product of parameter- 
labeled operators such that larger parameter values are put to the left, e. g., P (p(si) p{s2) p{ s s)) = 
p(ss)p(s2)p(si), for s 3 > s 2 > s\. Having a horizontal lift, the geometric phase reduces 
to $ 9 (C ) - * to t(C ) = arg<V(si)|V(s2)>. Now, (^(«i)|V(* 2 )> - Tr (|V(« 2 )> and 
IVK^)) = P (° X P J"/* f>{s)ds*j \ip(si)}, so that we can finally write 

$ ff (C ) = arg Trjp (exp jf ' p(a)ds) p(si)} (3.9) 

Eq.(3.9) gives the desired expression of <j> 5 (Co) in terms of ray-space quantities. Co is any smooth 
curve in ray-space. If Co is a closed curve, p(s2) = p{si), and |V'( S 2)) must be equal to |?/>(si)) 
up to a phase factor: |i/>(s 2 )) = e M with a = arg (ip(s2)\ip(si)}. For the horizontal lift we 

are considering, a — arg (^(s2)|V'(si)) = $ s (Co) and we can thus write 

|V>(s 2 )> - P ( cx p/ S2 /K s ) da ) l^( Sl )) = exp(^ s (C )) • (3.10) 

3.2 Geodesies 



Let us introduce now the concept of geodesies in both the Hilbert-space and the ray-space. To 
this end, let us refer again to Eq.(3.7). The action of K(s) on \tp(s)) produces a vector which 
is orthogonal to \i(>(s)), that is, (ip(s)\ K(s) \ip(s)) = 0. In general, (ip(s)\ip(s)} ^ 0; that is, 
the vector \ip(s)} tangent to the curve C = {\ip(s)}}need not be orthogonal to it. What K(s) 
does is to produce, by acting on |?/>(s)), the component of |V>(s)} that is tangent to the curve Co 
. Such a component is \ip(s)} less its projection on \ip(s)}, i.e., \ip(s)} — (^p(s)\tp(s))\'ilj(s)) . Let 
us denote this component by \ip(s))± = K(s)\ip(s)). Under a gauge transformation \ip(s)) — > 
\ip'(s)) = cxp(ia(s)) IV'(s)) we have that K'(s) — K(s), and hence it follows that \ip'(s)}± = 
exp(ia(s)) \ip(s))j_. Thus, the modulus of \ip(s))j_ is gauge invariant and thus constitutes an 
appropriate quantity to be used for a definition of extremal curves. The extremal property is 
defined with respect to an appropriately defined "length" of the curve. But as we want such 
a definition to be also parameter invariant, we need the length to be defined in terms of an 
integrand that is homogeneous of first degree in \ip(s)}. This can be achieved if we consider the 
square root of the modulus and define the following functional of Co as the length of this curve: 

£(c ) = f 2 yW( s M s )Kd s . (3.ii) 

J si 



Geodesies are defined as those curves for which £(C ) is extremal. By applying the tools of 
variational calculus it is straightforward to obtain the following differential equation: 

; ^ (S)K = MM*)), (3-12) 



ds 



/lMs)Ms)>± 



with f(s) an arbitrary real function. Although Eq.(3.12) involves quantities that depend on 
some lifted curve Co it must be gauge and reparamctrization invariant, because it follows from 
a quantity, Eq.(3.11), possessing such properties. We may therefore choose a different lift and 
parametrization to transform Eq.(3.12) according to our conveniences. We can hence choose a 
horizontal lift: (ip(s)\ip(s)) = 0, which implies that \ip(s))± = |V>(s)) and choose the parameter s 
such that (ip(s)\ip(s)) is constant along Cq. This reduces Eq.(3.12) to 
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£*M8)) = JWsMWMMs)) (3.13) 



Now, deriving twice (ip(s)\i/j(s)) — 1 we obtain \J (tp(s)\ip(s))f(s) + (ip(s)\ip(s)) = 0, which fixes 
f(s) to 

f(s) = -y/(i>( S )Ms)), (3-14) 

and Eq.(3.13) reads finally 

^(s)) = -u 2 \^(s)), (3.15) 

with u 2 = (^>(0)|V>(0)). This is the equation that geodesies must satisfy, whenever they are taken 
as a horizontal lift from the corresponding geodesic Co in ray space and with the parameter s being 
chosen so that (tp(s)\tp(s)) is constant along the geodesic (this is called an "affine" parameter). 
Eq.(3.15) can be easily solved for the initial conditions |^>(0)) = \(pi) and |^>(0) ) = u\(p2), thus 
satisfying = 1, (</>i|</>2) = 0, and {^ 2 |02> = 1: 

|V(s)) = cos (us) |0i) + sin (us) (3.16) 

An important consequence of this equation is that (-0(0) |-0( s )) = (<f>i\i>(s)) = cos(ujs). Now, we 
can always choose a parametrization such that cos (us) > 0, for s e [si, s 2 ]- Assuming that such 
a choice is made, arg(i/>(0)|'i/'(s)) = 0. But because our lift is a horizontal one, a,rg(tp(0)\tp(s)) = 
$3 (C ), so that 



$ 9 (C ) = for a geodesic C . (3.17) 

From Eq.(3.16) we see that our geodesies are arcs of circles in the space whose orthonormal 
basis is {\<fii) , | ezJ> 2 ) } - We are thus dealing with a two-level system. A geodesic \ip(s)) such as the 
one defined in Eq.(3.16) projects onto a geodesic in ray-space p(s) — \ip(s))(tp(s)\. Last one can 
be mapped onto the unit sphere in a well-known manner. Indeed, for a two-level system p(s) has 
the form 

p{s) = \{I+ n(s)-?), (3.18) 

with ~o being the triple of Pauli matrices, / the identity matrix, and ~n a unit vector given by 
~n = Tr(p ■ It). 

It is therefore clear that two given (unit) vectors, and ^2), can always be connected by 
a geodesic. Indeed, we need only note that to the two vectors \tpi) and \if>2) there correspond two 
unit vectors ~rt\ and ~n 2 lying on the unit sphere. There is always a great circle passing through 
two given points on the sphere. This is the geodesic arc joining p x and p 2 that can be lifted to 
a geodesic arc in Hilbert-space joining \tpi) and | -i/>2 } - If necessary, once we have this geodesic we 
can submit it to a gauge transformation, thereby generally destroying its horizontal but not its 
geodesic property. 

Let us show this procedure in more detail. Consider two vectors: \ipi) and |V4)- We take any 

orthonormal basis |0 2 )} with \^) = |^) (and = (|^> - |^i)^i|V 2 )) (l - l(0i|^2>| 2 )" 

though this explicit form will not be required for our present purposes) . In such a basis, the gen- 
eral form of ^2); U P to a global phase factor, is \1Jj2) = cos(6'/2)|0i) + e lv sm(8/2)\<p2), that 
is, \ip' 2 ) — e ta \ip2)- We start by considering first the case in which the final vector is given by 



3.2. GEODESICS 



19 



\ip2), and then we deal with the more general case of |^> 2 ) = s la \ip2}- The corresponding pro- 
jectors p\ = \ipi)(tpi\ and p 2 — 1^2} (V^l are given by expressions of the form of Eq.(3.18) with 
Tii = (0,0, 1) and it 2 = (cos<£sin#,sin</?sin#,cos#). That means that «i is the North pole (of 
what is called the "Bloch sphere") and "n 2 is a point whose spherical coordinates are (6,tp). In 
order to bring n 1 to n 2 along a great circle we can submit it to a rotation whose axis is given 
by a unit vector it parallel to Tti x it 2 , the rotation angle being equal to 0, the angle between 
"ni and it 2- The unit vector it is thus given by 

n = ^-r— . (3.19) 

To the rotation bringing "rii to it 2 it corresponds a SU{2) transformation bringing |i/>i) to |V>2), 
viz, C/(0, ^j) = ^2), which is given by 



U{0, p) = exp • 7?) = cos J-i sin (0 7?-7? = cos (£) l-i ^-^L .?. (3.20) 

By defining |^>(s)) = £/(0s, <^)|<^i) we have that, first, |^(0)) = \ipi), (V'(l)) = I ^2) and, second, 
that the curve defined by \i>(s)}, s £ [0, 1], is a horizontal geodesic. Indeed, by explicitly writing 
U(0s, ip) as 

U(9s, <p) = cos (^^j I — ism (^ s ^j ~™ ' <p ' ~® (3-21) 

with ~n v = (— sin </?, cos </?, 0) it is straightforward to verify that \ip(s)) fulfills the defining prop- 
erties of horizontal geodesies: 



|^|V>(s)) = -^(sMs)Ms)> = -^|V>(s)>, (3-22) 
W>{8)\i>{8)) = 0. (3.23) 

Hence, for |^i) = \(j)i} and \ip2) = cos(0/2)|(/>i) + e ttp sin(#/2)|0 2 ) we have proved that there is 
a horizontal geodesic joining them, which is given by \ip(s)) = U(9s, <£>)|</>i), with U(0s, <p) as in 
Eq.(3.21). Next, let us consider a general final vector \i/j' 2 ) = e la \ip2)- In this case we need only 
change U(0s,ip) by e~ las U{0s 1 ip) and the curve \ip'(s)) = e- ias U(9s, ip)\<j)i), with ^'(0)} = \ipi), 
= \ip' 2 ), will still be a geodesic; that is, it satisfies Eq.(3.12) (with f(s) = 9/2) though it 
is no longer a horizontal one ((ip' (s)\ip' (s)) = —ia). 

In summary, we have proved that any two vectors, \tpi) and |^>2)> can be connected by a 
geodesic Co- If this geodesic happens to be a horizontal one, then its dynamical phase vanishes 
and so does its total phase arg^i IV^), as follows from Eq.(3.16). Hence, the geometric phase 
^s(Co) = 0. This last property is gauge independent and so holds true for any lift. However, 
if the geodesic Co is not horizontal, then &dyn(Co) 7^ and so also arg(V>i ^2) 7^ 0, though 
$ S (C ) - 0. 

We note that irrespective of the dimensionality of the Hilbert space to which \ip\) and ^2) 
belong, the only tools we have used to show how they can be joined by a geodesic are those of a 
two-level system. This is because there were only two vectors involved and we could thus restrict 
ourselves to work on a two-dimensional subspace of the whole Hilbert space. Now, the fact that 
any two unit vectors can always be connected by a geodesic leads to an alternative formulation 
of the geometric phase. It rests upon the concept of Bargmann invariants, that we shall present 
next. Eq.(3.17) will play a central role, together with quantities like arg^ilV^)- When the total 
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phase arg(^i|^2) = we say that \ipi) and ^2} are "in phase". As we have seen, this is the 
case when these vectors can be joined by a horizontal geodesic. Consider a third vector \tp 3 ) that 
is joined to ^2) by a horizontal geodesic and hence arg^lV'a) — as well. Our three vectors 
are thus joined by a curve conformed by two arcs, each one being a horizontal geodesic. Can we 
conclude that the total phase between |-0 3 ) and \ipi) vanishes too? The answer is generally on the 
negative. Being "in phase" is not a transitive property. The following discussion will illustrate 
this point. 



3.3 Bargmann invariants 

Let us take N points in ray-space: pi, pi, ■ ■ ■ , Pn- As we have just seen, each pair of these 
points can be connected by a geodesic arc. Let us denote by Co the curve formed by the N — 1 
geodesic arcs joining the N points. Let us assume that any two successive pairs of the points 
are non-orthogonal. That is, choosing any lift of them, , IV'2) , ■ • ■ , H'n), all inner products 
(ipi\ipi + i) ^ 0, for i = 1, . . . , N — 1. Let us calculate <I> 3 (Co). It is given by 

N-l 

$ 5 (C ) = <S> tot (C ) - $ dyn (Co) = arg (^ N ) - ]T (3.24) 

fe=i 

where < &^'^ +1 ' is the dynamical phase for the geodesic joining \ipk) with \ipk+i)- But, because 
<4 fc < fc+1) = 0, we can write d>^ +1) = = arg<^ +1 >. Now, E^ 1 arg(^ fc+1 ) 

argnf=i 1 (V'fe|V'fe+i), and arg(^i|V>jv) = - arg^l^i), so that 

N-l /N-l \ 

$ 9 (C ) = arg(^i|^jv> - arg (ipk\ipk+i) = -arg I ]J (V'felV'fe+i) I (V'jvIV'i) (3-25) 

fc=i \fe=i / 

and we can write 



$ 9 (C ) = -arg(Vi|V2)(V2|V'3)...^iv|Vi)- (3-26) 

Although this expression for <E> g (Co) has been derived by joining \ipi), ■ ■ ■ , IV'jv) with a series 
of geodesic arcs that all together make up Co, the final expression contains nothing but the points 
IV'i), • • • , \iPn) themselves. Such a result brings strikingly to the fore the topological character 
of the geometric phase. Quantities like |"02) ("02 1"03) ("03 1 ^1} are called "Bargmann invariants". 
They generalize KV'ilV^)! 2 , which is invariant under simultaneous U(l) (gauge) transformations 
of its constituent parties: |?/>i) — > = exp(iai) \ipi), \1jj2} —> = exp(ia2) IV^)- Quantities 
that are invariant under U (1)®U (l)(g). . . were introduced by Bargmann for studying the difference 
between unitary and anti-unitary transformations [11]. 

The curve Co in Eq.(3.25) was assumed to be open, as pn =/= Pi- However, we can have a 
closed curve Co by completing the TV — 1-sided polygon Co with a geodesic arc that connects 
p N with pi. By repeating the above steps that led us to Eq.(3.25), though taking into account 
that now <& tot (C ) = because the final point tpN+i = V'li we see that <J> 5 (Co) = — &dyn(Co) = 
— a^gYl k=1 (ipk\ipk+i} , so that <& 9 (Co) is given by just the same expression as in Eq.(3.26). In 
other words, <J> g (C ) = ^(Co): both C and C have the same geometric phase. 

Starting from Eq.(3.26) it is possible to recover the results previously found for general open 
curves [8] . One proceeds by approximating a given curve by a polygonal arc made up of N — > 00 
geodesic arcs. By a limiting procedure one recovers then <S> g (Co) = <&tot{Co) — &dyn(Co) with 
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^tot(Co) and &dyn(Co) given by Eqs.(3.3) and (3.4), respectively. Also Eq.(3.9) can be recovered 
in a similar fashion [8]. 

Coming back to Eq.(3.26), let us denote the iV-vertex invariant of the right-hand side as 

AW(Vi, . . . ,iM = MM (Ml*) ■ ■ ■ (Mi/fi). (3-27) 

We can prove by induction that it can be expressed in terms of the (fundamental) three-vertex 
invariant as 

N-l 

argAW(Vi,...,Viv)= ^ arg A( 3 )(0 1; fe ,0 fc+1 ). (3.28) 

k=2 

This result is suggested by considering the TV-vertex invariant as represented by an TV-sided 
polygon and then subdividing this polygon into N — 2 triangles. For example, for N = 4 such a 
triangulation which adds a line connecting the vertices 1 and 3 - suggests that we insert in the 
expression for A( 4 )(0i, . . . , tpi) the quantity (03|V>i}(V'i|V'3) which does not alter its argument 
because it is real and positive: 



argA (4) (0i,...,V> 4 ) = arg((V^2>(^2|V'3>(V'3|V'4>(V'4|V'i> x (03|V'i)(V'i|03)) 
= arg <-0i | -02> (-02 1 -03 } ("03 1 -01 > + arg(^ 1 |V'3)(^3|V'4}(V'4|V ; i) 
= A^i^V^+A^i,^,^). 

It is then clear how this result generalizes to Eq.(3.28). The triangulations leading to the additive 
property expressed by Eq.(3.28) suggest also that arg A^ has the nature of an "area". It can 
be shown, indeed, that <fr g can generally be expressed as a (two-dimensional) surface integral of 
a two- form [8] . 

Let us consider the concrete case of a two-level system and calculate A^ 3 '(tpi, ip2, 1P3). We can 
write this Bargmann invariant in the form 



A< 3 >(V>i,V2,$0 = (HH(H^3)(Hi>i) = (WtoPaWi) (3-29) 
= Tr[\1> 1 )y> 1 \p2Ps]=Tr\p 1 p 2 p s ]. (3.30) 

For two-level systems, pi = (I + ~n j • 7?) /2, i = 1,2,3. A straightforward calculation gives 

Tr [P1P2P3] = * [1 + "n 1 • ~rt 2 + ~rti • ~rtz + ~n 3 ■ ~n 1 + irti ■ (r? 2 x "n 3 )] . (3.31) 

Now it is easy to obtain the phase \ °f A( 3 )(0>i, -02, "03); as we have the real and imaginary 
parts of this last quantity. Denoting by 9jk the angle between n j and n k, i.e., cos 6 jk = n j ■ n k, 
we obtain from Eq.(3.31) that 

= cos 2 (gig/2) + cos 2 (0 23 /2) + cos 2 (ggi/2) - 1 
C ° &X 2cos(0 12 /2)cos(0 23 /2)cos(# 3 i/2) ' 1 ' ' 

which is a well-known expression for the cosine of half the solid angle of the spherical triangle 
whose vertices are ~n\, ~r? 2 , f?3- This means that x 1S j ust half the solid angle subtended by 
the three points representing the states \ipi), \ip2), 103) on the (Bloch) sphere. This is a central 
result in the theory of Berry phases and constitutes a special case of what we have derived before, 
namely Eq.(2.50) with k = 1/2, as it corresponds for spin-1/2 particles. 
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Chapter 4 

Pancharat nam's phase and its 
measurement by polarimetry and 
interferometry 



We have introduced the total phase between two states as arg(^i|^2)- This constitutes a gen- 
eralization of Pancharatnam's definition for the relative phase between two polarization states 
of light. Pancharatnam [3] addressed the problem of how to decide whether two nonorthogonal 
polarization states could be said to be "in phase" . His proposal was based on the following oper- 
ational prescription. Consider two non-orthogonal polarization states, |z) and |/), with \i) ^ |/) 
and let them interfere with one another. Apply a phase-shift <j> to one of the states. The resulting 
intensity pattern is given by 



Find now <f> for which / is maximum. This occurs for <f> = arg (i\f) = &tot, which is thereby defined 
as the total (or Pancharatnam) phase between |z) and |/). If arg(i|/) = the states are said to 
be in phase. Polarization states are two-level systems. When they are submitted to the action 
of intensity-preserving optical elements, like wave-plates, then the corresponding polarization 
transformations belong to the group t/(2), or to the group SU(2), if global phase factors are 
left aside. We can exhibit $ tot by submitting an initial state |z) to an arbitrary transformation 
U E SU(2) that brings it into a final state |/) = U\i). The intensity measurement for which 
Eq.(4.1) applies can be implemented with, say, a Mach-Zehnder interferometer. Alternatively, 
one could employ polarimctric methods. We will discuss both methods in what follows. But 
before that, it will be useful to say some few words about the different parametrizations used for 
writing the elements of SU(2). 

As we saw before, a parametrization of U <G SU(2), when viewed as realizing a rotation by an 
angle a around an axis n = (sin 9 cos ip, sin 9 sin ip, cos ip), is given by 



In this case U is specified by the three parameters (a, 9, ip). Alternatively, we could specify U by 
so-called Eulcr parameters: 



I = |e<* \i) + \.f) I 2 = 2 + 2 |<t|/>| cos (0 - arg (i\f)) . 



(4.1) 



U(a, 9, <p) — exp i—n-~a 



(4.2) 




(4.3) 
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which we call the Y ZY-form for obvious reasons. It corresponds to three rotations around 
Cartesian axes e y and e z . Another useful form of U, the ZY Z-form, is given by 



v <5 + 7 \ (-t^~^\ \ f e l5 cosf3 -e J7 sin/3 



(4.4) 

The representation given by Eq.(4.4) is particularly useful for exhibiting Pancharatnam's phase. 
Indeed, taking as initial state \i) — \+) z , the cigcnstatc of o z for the eigenvalue +1, and setting 
\f) = U\+) z we obtain 

(<l/> =* (+1 U(p n ,d) \+) z = e^cos/3. (4.5) 

Thus, §tot = ar g (i\f) = 5 + arg(cos (3), for (3 ^ (2n + l)n/2. Because cos @ can take on positive 
and negative real values, arg(cos/3) equals or 7r, and <$>tot is thus defined modulo tt. The 
parametrization of Eq.(4.4) gives therefore <& tot = S (modulo tt). On the other hand, for the 
optical implementation of U another parametrization is more appropriate. It is well known that 
we can implement any U G SU(2) with the help of three retarders, two quarter-wave plates and 
one half- wave plate [19, 20]. In polarization-space of, e.g., horizontally and vertically polarized 
states of light, {\H) , \ V}}, U is best represented in the form given by Eq.(4.3), the YZY-form, 
because of the following relationship involving the Eulcr angles a\, a 2 , a^: 



exp (-i(a 3 + 37r/4)cr y ) cxp (i(ai - 2a 2 + a 3 )a z ) exp (i(ai - tt/A)<t v ) = Q(a 3 )H(a 2 )Q(ai). 

(4.6) 

Here, Q means a quarter-wave plate and H a half-wave plate. The arguments of these retarders 
are the angles of their major axes to the vertical direction. In the case of the U given by Eq.(4.3), 
by applying Eq.(4.6) we obtain 



Having seen the different parametrizations which are relevant for our purposes, let us turn now 
to discuss the implementation of some experimental arrangements that exhibit Pancharatnam's 
phase. 

4.1 Interferometric arrangement 

In general, with an interferometric array Pancharatnam's phase can be obtained from intensity 
measurements that are described by Eq.(4.1). Introducing U from Eq.(4.3) into Eq.(4.1) we 
obtain 



-^(e i *\+) z + U(Z,r ] ,0\+) z )\ = (4i 



= 1 - cos (I) cos (^— ^— j cos {<!>) - sin (|) c °s [~^~ J sin ^ ' 

From Eqs. (4.3,4.4) we derive the following relationship for the parameters of these two represen- 
tations: 
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to " (4) ='^l)ES- (4 - 9) 



COS^-g-y 

Hence, I can be written as 

I = 1 - cos (I) cos (J-^j sec (^) cos (<* - <t>) > ( 4 - 10 ) 

making it evident that an interferometric method for exhibiting <!> tot would require measuring 
the shift induced by U on the intensity pattern by an angle 8 — <fr t ot (modulo it). Now, the 
expression for I as given in Eq.(4.10) mixes 5 with parameters of a representation to which it 
does not belong. By converting Eq.(4.10) to the representation f/(/3, 7, 8) we obtain 

I = 1 - cos (/3) cos {8 - <f>) . (4.11) 

The visibility v = (J max — 7 m i n ) / (l max + 7 m i n ) is thus given by v = cos/3. It is independent of 
Pancharatnam's phase. In terms of the parameters £, i], £ the visibility is given by 

v 2 (£, r], C) = ^ [1 + cos £ cos ( — cos 77 sin £ sin (] . (4-12) 
For comparison with experimental tests, it is useful to write v 2 in terms of the retarders' angles: 



v 2 (ai,a 2 ,a 3 ) = 



/37r + 4a 3 

1 + COS I I <'( is 



7r — 4ai 



cos (2ai — 4a 2 + sin 



37r + 4a3 



7r — 4ai 



Let us now refer specifically to a Mach-Zchnder interferometer. Our initial state is taken to 
be the vertically polarized state \V) that enters the first beam-splitter (sec Fig. (7.1)) along the 
X-direction. A straightforward calculation yields for the intensity measured at one of the output 
channels: 



Iv = 



cos 



cos {4>) - sin (J) cos ( ) sin (0) 



This can also be written as 



(4.13) 



Jy = - [1 — COS (/?) COS ((/> — <*>)] 



(4.14) 



The above result suggest a direct measurement of Pancharatnam's phase 8 = $tot (modulo 
7r) by recording the fringe-shift between two intcrfcrograms, one of them serving as a reference 
(8 = 0), and the other being obtained after applying the U transformation. There is however 
a practical problem with this method. An interferometric array is very sensitive to minute 
changes of its components, making it very difficult to accurately measuring 8. Different strategies 
are usually applied to overcome these shortcomings. Mechanical and thermal isolation of the 
arrangement is the most direct one, but measurements are then rather awkward. Damping 
instabilities by a feedback mechanism is another possibility; but it makes the arrangement more 
involved and difficult to operate. 

Recently, a novel strategy has been employed [23] with a Mach-Zehnder array. It was based on 
the following observation. Eq.(4.14) holds for an initial state that is vertically polarized. When 
the initial state is instead horizontally polarized, then the intensity is given by 
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(4.15) 



We observe that intensities ly and Ih are shifted with respect to each other by 25, and this 
can be exploited for measuring 5. To this end, one polarizes one half - say the upper half - 
of the laser beam vertically, and the lower half horizontally. With such a beam one feeds the 
interferometer, so that one can capture at the output an interferogram, half of which corresponds 
to ly and the other half to Ih- The upper and lower fringes will show a relative shift of 25. As 
both halves come from the same beam, they will be equally affected by whatever perturbations. 
The array is therefore insensitive to instabilities. One thus needs only to accurately measure the 
relative fringe-shift in each interferogram in order to obtain 5. Fig. (7.1) shows an sketch of the 
interferometric arrangement. By applying this method Pancharatnam's phase could be measured 
with an accuracy that is similar to the one reached by the polarimetric method. Experimental 
results are shown in Fig. (7.4) for cos 2 $tot- The versatility of the arrangement was also tested by 
measuring the visibility and comparing these results with the theoretical predictions given above 
for v 2 (ai,a 2l a 3 ) (see Fig.(7.6)). 

4.2 Polarimetric arrangement 

Wagh and Rakhccha proposed some years ago [12, 13] a polarimetric method to measure Pan- 
charatnam's phase. Such a method is experimentally more difficult to implement than the inter- 
ferometric method; but it was considered the more accurate of the two, because it requires a single 
beam. Both methods were tested in experiments performed with neutrons [14, 15], whose spins 
were subjected to SU(2) transformations through the action of a magnetic field. The conclusion 
was that polarimctry was inherently superior to interfcrometry, because of the aforementioned 
instabilities of the latter, where two beams are involved. Now, it is not obvious that one can ex- 
tract phase information from a single beam. However, as we shall see, polarimetry in the present 
case can be understood as "virtual interfcrometry" . 

Consider an initial, polarized state |i) = \+) z that is submitted to a 7r/2-rotation around 
an axis perpendicular to the polarization axis (z), e.g., a rotation around the x-axis. This pro- 
duces the state (|+) z — i \—) z ) /V2. Applying then to this state the operator exp {—<fxr z /2) we 
obtain the state V \+) z = exp (-i<jxr z /2) exp (-iir(j x /A) \+) z = (e^' 2 \+) z - ie 1 ^ 2 [-) J /y/2 = 
e -»0/2 (|+) z — ie 1 ^ \—) z ) j\f2. We have thus generated a relative phase-shift 4> between \+} z 
and |— ) . This is analogous to what is achieved in an interferometer by a relative change 
in the length of the two optical paths. We apply then a U e SU(2) and obtain the state 
UV \+) z = (er^/ 2 U\+) z -ie^l 2 U\-) z ) /y/2 = \ X+ ) + \x-) ■ From this last state we can ex- 
tract Pancharatnam's phase. To this end, we project |x+) + \x~) on the state The 
corresponding intensity is given by 



Let us write V \+) z = (e^/ 2 \+) z - ie 1 ^/ 2 \-) z ) /y/2 = \ip + ) + \tp+) and take U as given by 
7, *) of Eq.(4.4). Calculating the amplitude z (+| W (| x+ ) + | x _)) = ((<^ + | + (tp_\)(\ x+ ) + 
\X-)) we obtain, using (f±\x±) = exp (±iS) cos (/?) /2, and (<p T \x±) = i exp (t«(7 + 4>)) sin (P) / 2 , 
that ({<p+ \ + (<p-\)(\x+) + |X-)) = cos (P) cos (<^) +«sin (/?) cos (7 + <f>) and, hence, that the inten- 
sity amounts to 



A+\vH\ x+ ) + \x-)) 



(4.16) 



I = cos 2 (0) cos 2 (5) + sin 2 (/?) cos 2 (7 + 4>) . 



(4.17) 
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Eq.(4.17) contains Pancharatnam's phase 6 = $tot so that it can be extracted through inten- 
sity measurements. Indeed, we observe from Eq.(4.17) that the minimal and maximal intensities 
are given by /,„;„ = cos 2 {(3) cos 2 (5) and J max = cos 2 cos 2 (5) + sin 2 (/3), respectively, so that 

cos 2 (6) = / mi " . (4.18) 

A concrete optical arrangement requires that we implement V and U with retarders. To begin 
with, exp(-Mra x /4) = Q(f) andexp(-i^7 z /2) = Q(f )H( *f*)Q(f )■ Using Q 2 (f) = tf(f) and 
exp (+ito,/2) = Q{-l)H{^)Q{-l) we obtain 



- *uv = h (_i) n Q (- J) TO Q h (ti) H (J) . (,»> 

As for {/, it is convenient to employ the form U(£,r], Q of Eq.(4.3). This form can be directly 
implemented with retarders according to Eq.(4.7), i.e., in the form QHQ. Inserting this QHQ 
for U into Eq.(4.19) we end up requiring nine wave plates. In order to reduce this number 
as much as possible we apply relations like Q(a)H(/3) = H(/3)Q(2(3 — a), Q(a)H(/3)H(j) — 
Q(a + n/2)H(a — (i + 7 — n/2), and similar ones [21]. After this, we obtain an array of five 
retarders: 



Um __ Q (_*_fy(_^_t) Q (_*±*^_ l]x (4 , 20) 

x „(_!l±l±lzC_|) (_E_ 

Note that such an arrangement could be implemented by mounting the five plates having a 
common rotation axis, so that all the plates can be rotated simultaneously by the same angle 
0/2. The intensity to be measured is given by the following expression: 

/ = \A+\Utot\+)f= (4.21) 
= cos 2 (|) cos 2 (*±?) + [cos (|) sin (*±?) cos (0) + sin (|) sin sin (<P) ' 

From this intensity we extract Pancharatnam's phase, as given by Eq.(4.18). This theoretical 
prediction have been tested under different conditions [23]. Thus, by fixing ( to 2ir, so that 
cos 2 (6) — I m i n (1 — I max + J m in) 1 = cos 2 (f]/2) for all £, Pancharatnam's phase <& to t = »?/2. 
With ( = 2tt the arrangement that realizes the corresponding U to t reduces to 

uLT" = Q{cp)q(-\ + <^h + ^ (4 22) 

in which the rotation angle (j) has been redefined according to (— 3ir — 2(f)) /4 — > <f). Experimental 
results are shown in Fig. (7. 5). 

If instead £ = — n, it still holds that cos 2 (S) = J min (1 — 7 max + /,„;„) _1 = cos 2 (rj/2), this 
time for all (, so that <fr tot — rj/2, as before. The corresponding arrangement is now given by 

VST-Q l^^) H ( -** + <, + " - 2 * ) Q (^* ). (4.23) 
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The full arrangement of five wave-plates was also implemented as shown in Fig. (7. 2) and the 
results confirmed theoretical predictions with the expected accuracy, as also shown in Fig. (7. 5). 
Though all these experiments were performed with a cw He-Ne laser an alternative setting using 
single- photon sources should produce similar results. This is because all predictions under study 
rest on topological rather than on quantum-mechanical features. 



Chapter 5 

Geometric phase for mixed states 



The geometric phase has been introduced as a feature that involves projectors p = \ip){ip\. i.e., 
pure states. It is natural to ask whether a similar concept can be defined for mixed states. The 
answer is that it is indeed possible to do this, as was shown by Uhlmann [17], who based his 
approach on the concept of parallel transport. When \tp(s)) evolves in some parameter space 
under parallel transport it does it by remaining in-phase with its infinitcsimaly neighboring state 
\ip(s + ds)). We say that under parallel transport the system does not suffer local phase changes. 
However, after completing a closed loop a state may have acquired a nontrivial phase, which can 
be traced back to the curvature of the parameter space. This notion can be extended to systems 
being in mixed states. To this end, Uhlmann considered a so-called "purification" of the system 
being in a mixed state into a sytem being in a pure state. That is, one considers the mixed 
state as corresponding to a subsystem of a larger system which is in a pure state. There are 
infinitely many possible purifications of a given mixed state. Thus, a cyclic evolution of the latter 
induces infinitely many evolutions of the purified systems. However, one of these evolutions can 
be singled out as the one which is maximally parallel. This leads to a definition of geometric 
phases for mixed states. 

An alternative approach was addressed more recently by Sjoqvist et al. [18]. It is more 
physically motivated than Uhlmann's, but like this one it rests on connecting the mixed system 
to pure systems. The starting point is Pancharatnam's approach to the total phase. Let us 
remind that in this approach one considers the interference of two states. One of them, the 
initial state \i), is the reference state to which a phase-shift <p is applied. This state interferes 
with a state |/) = U\i), with U being a unitary operator. The interference pattern is given by 
Eq.(4.1), that we re- write here as 

I = \e^ \i) + U \i) f = 2 + 2 \(i\U\i)\ cos (<j> - arg<i|tf|i)) = 2 + 2«cos (<j> - $ tot ) , (5.1) 

with v = \{i\U\i)\ the visibility and Q t ot = arg(z|f/|z) the total phase between \i) and |/) = U\i). 

Consider now a mixed state p = with Y^ii w i = 1- The intensity profile will now 

be given by the contributions of all individual pure states that conform the mixed state p: 



I = J2 Wi K I*) + U l*)| 2 = 2 + 2 J2 Wi KW>I c°s (<f> - wg(i\U\i)) . 



(5.2) 



We can write I in a basis-independent form as 



/ = 2 + 2 | TV (Up) | cos [<t> - arg Tr (Up)} . 



(5.3) 
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It is then clear that the visibility is now given by v = \Tr (Up)\ and that the total phase can 
be operationally defined as <& tot = arg Tr(Up), which is the value of the shift (f> at which the 
maximal intensity is attained. Of course, such a definition reduces to Pancharatnam's original 
definition for a pure state p = 

The above definition assumes a unitary evolution |z) — > \ f) = U\i). A non-unitary evolution 
can be handled by modeling it with the help of an environment being in a pure state |0 e )(0 e | 
that is appended to the given mixed state. The system plus environment is then described by 
p = p® |0 e )(0 e | and evolves unitarily p — > p' = UpW in such a way that by tracing over the 
environment we recover the given change of the mixed state: p — ► p' = Tr e p'. Introducing an 
orthonormal basis {|fc e )}fc=o,...,M f° r the environment, we can write Tr e ]7 = £\ K^pKt, with 
Kk = (k e \U\Q e ) being the so-called Kraus operators. The map p — > p' is thus a completely 
positive map that takes density operators into density operators. 

Let us turn back to unitary evolutions. We have extended Pancharatnam's phase to unitarily 
evolving mixed states. Now let us address a similar extension of the geometric phase. The unitary 
evolution we have in sight is of the form p(s) = U(s) p W(s). For pure states \ip(s)) the geometric 
phase was equal to the total phase for a state undergoing parallel evolution: (tp(s)\ip(s)) = 
0. In that case the dynamical phase vanishes. We should then try to extend the notion of 
parallel transport for mixed states. For such an evolution the state p(s) should be in phase 
with the state infinitesimally near to it, p(s + ds) = U(s + ds) p W(s + ds) = U(s + ds) 
U\s)p(s)U(s)U^(s + ds). According to our previous definition, the phase difference between p(s) 
and p(s + ds) is given by arg Tr (U(s + ds)W (s)p(s)) in this case. For p(s) and p(s + ds) to 
be in phase then, arg Tr (U(s + ds)W(s)p(s)) should vanish. However, because Tr (p(s)) = 1 

and p(s)^ = p(s), the number Tr (ifW p^j is purely imaginary. Hence, a necessary condition for 
parallel transport is 

Tr (U(s)tf(s)p(s)^j =0 (5.4) 

However, such a condition is not sufficient to fix U(s) for a given p(s). Indeed, considering 
any N x N matrix representation of the given p, Eq.(5.4) determines the N x N matrix U only 
up to N phase factors. In order to fix these factor we must impose a more stringent condition: 

(k(s)\U(s)U^(s)\k(s)) = 0, k = l,...,N, (5.5) 

where p(s) — J2 k Wk\k(s))(k(s)\. This gives the desired generalization of parallel transport to the 
case of mixed states. Eq.(5.4) reduces to (i/)(s)\ip(s)) — for a pure state p(s) = \ip(s))(tp(s)\. 
Indeed, Eq.(5.4) gives Tr (ll (s)W (s) p(s)} = Tr (u^(s)U(s)p(0)^ = 0, so that for p(0) = 

|V(0))(V>(0)| we have Tr (£/T(s)£7(s)|^(0))(^(0)|) - (ij(0)\W (s)U(s)\ip(0)) = (tp(s)\ip(s)) = 0. 

We can now define a geometric phase for a state tracing out the curve defined by the map 
C : s — > p(s) = U(s)p U J< (s), with s £ [si,s 2 ] and U(s) satisfying Eqs.(5.4) and (5.5). The 

dynamical phase &dyn = —i dsTr (u^ (s)U(s)p(0) S j then vanishes and we define the geometric 
phase <i>g for mixed states as 

$ s = argTr(C/( S )p(0)). (5.6) 

Considering the ensemble decomposition p(0) = ^2 k Wk\k(0))(k(0)\ a look at Eq.(5.2) makes 
E> g in the form 

$ 9 - a,r g Tr(U(s)p(0)) - arg ( ^ TO e^ j , (5.7) 



clear that we can write <& g in the form 
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with the geometric phase factors e m being associated with the individual pure-state paths fol- 
lowed by the members of the ensemble. 

<&g is manifestly gauge and parametrization invariant and has been defined for open paths, 
in general. However it does not share all the properties of the geometric phase for pure states. 
In particular, it is not additive in the sense of Eq.(3.28). Nevertheless, in special cases it can be 
connected with a solid angle. Let us see an example of such a case. 

Consider a two-level system. In this case 

p = = \{I + rn-^), (5.8) 

with ~n ■ ~n = 1 and r being constant for a unitary evolution U. For pure states r = 1, while 
r < 1 for mixed states. The unitary evolution of p(s) makes ~n(s) to trace out a curve on 
the Bloch sphere. Suppose this curve is closed by joining its initial and final points with a 
geodesic, so that it subtends a solid angle ft. Then, the two eigenstates |±;"n • ~a) of ~n ■ ~~o 
acquire the geometric phases as we saw before. Both states have the same visibility 

Vq = |(±; ~rt ■ ~~a |f/|±; it ■ ~~a )|. From Eq.(5.7) we obtain, considering that in the present case the 
Wk=± (the eigenvalues of p) are given by w± — (1 ± r)/2, the geometric phase 



arg 



1 



iCl/2 



1 



,+iQ/2 



2 2 
and the visibility v = \Tr (Up(0))\, that is 



arg 



cos 



ir sin 



arctan r tan 



m 

(5.9) 
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+tCl/2 



Vq \ / COS^ 



9 • 2 

r z sin 



(5.10) 



Eqs.(5.9) and (5.10) reduce for r = 1 to $ g = — £1/2 and w = v , respectively, the known 
expressions for pure states. For maximally mixed states, i.e., r = 0, we obtain $ g = arg cos (£1/2) 
and v = |cos (0/2) | , so that for the output intensity Eq.(5.3) gives 



I = 2 + 2 |cos (ft/2) | cos (</> - arg cos (ft/2)) = 2 + 2 cos (ft/2) cos (f>. 



(5.11) 



We see that for ft = 2ir there is a sign-change in the intensity pattern. This was experimentally 
observed in early experiments [16] testing the 47r symmetry of spin- 1/2 particles. 
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Chapter 6 



Off-diagonal geometric phases and 
geometric phase between 
orthogonal states 



As we have seen, in the context Berry's original approach, a system undergoing adiabatic evo- 
lution under the action of a parameter-dependent Hamiltonian H(s) accumulates a geometric 
phase. The instantaneous eigenstates of the Hamiltonian were denoted by \n;R(s)}, that is, 
H(R(s)) |n; R(s)) = E n (R(s)) \n; R(s)), with R meaning the set of parameters and the eigenval- 
ues E n being assumed as non-degenerate. Let simplify this notation, by writing H(s) \u n (s)) = 
E n (s) \u n (s)). When \u n (s)) is transported adiabatically along a closed loop R(s2) = R(si), the 
scalar product (u n (si)\u n (s 2 )) acquires a nontrivial geometric phase factor. It is assumed that 
(u n (si)\u n (s 2 )) 7^ 0; otherwise, the relative phase is not well defined. However, it might occur, 
for (i^ to, that \u n (s 2 )) = e la \u m (si)). In such a case both (u„(s 1 )|u„(s 2 )) and (u m (si)\u m (s 2 )) 
vanish, as E n (si) and E m (s 2 ) are non-degenrate, so that the Pancharatnam-Berry phase is un- 
defined. If we are looking for phase information, then we should be able to extract it from the 
off-diagonal matrix element {u n (si)\u m (s 2 )) . This was done by Manini and Pistolesi some years 
ago [7]. 

In the usual definition of the geometric phase one considers the parallel transport of a nor- 
malized state \u n {s)) along a curve C with s e [si, s 2 }. A state |ui!(s)) that is parallel transported 
satisfies (wn(s)|u!l(s)) = 0. It represents a particular lift of \u n (s))(u n (s)\, whose phase has been 
so chosen, that (un(s)\un(s + Ss)) differs from unity up to terms of second order in Ss. This means 
that the phase of \un(s + Ss)) is such that this state has maximal projection on the "previous" 
state |u|[(s)). We can make this phase-choice explicit, by writing 



KO2)) = exp 



/S2 
(u n (s)\ii n 
1 



(s))ds |u„(s 2 )) = exp - j (u n (R)\V\u n (R)) ■ dR \u n (s 2 )). 

(6.1) 

The geometric phase is then given by arg(u[i(si)|«H(s 2 )) or, equivalently, by the following phase 
factor that is defined in terms of the function $ (z) — z/\z\, for complex z^O: 

^ EE $ (C/f„) = $ (<«JI (*l)|uJ!(* 2 )>) • ( 6 ' 2 ) 

As we have seen, 7„ is gauge and parametrization invariant and is thus a measurable quantity. 
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The natural extension of 7^ appears to be the quantity o c nm = <I> (U% m ) = $ ^(ull(si)|w|ij(s2))J ; 

but such a quantity happens to be gauge dependent. Under gauge transformations \u n (s)) — ► 
exp(ia„(s)) \u n (s)), it transforms as 

a n m ^nm ex P (w*m(s) ~ »n(s)) . (6.3) 

Manini and Pistolesi [7] defined thus a measurable quantity related to cr^ m , that is the proper 
generalization of 7^, namely 

7nm ^nm^mni (6-4) 

and gave to it a simply geometric interpretation analogous to that of Pancharatnam's phase [7] . 
The simplest system on which off-diagonal phases could be tested is a spin-1/2 acted upon by a 
slowly rotating magnetic field whose orientation is so chosen that 7^L 1 . 2 are undefined and all 
the phase information is thus contained in 7^. The theoretical predictions for this case have 
been experimentally confirmed [22] . 

The goal of circumventing the restriction of dealing with non-orthogonal states when defining 
geometric phases has been led a step further by Wong, Cheng and Chu [6]. These authors 
introduced a "projective phase" between two states \tp(0)} and \ip(s)}, given by 



^(0, S )=arg(V(0)|i)(#( S )}, (6.5) 

with |i) being some state to which neither IV'(O)) nor \ip(s)) are orthogonal, though they may 
be orthogonal to each other. The physical idea behind the definition of the projective phase 
is similar to the idea behind Pancharatnam' s phase. In the latter case two states are taken 
to interfere with one another and then the amplitude (tp(s)\ip(0)} contains the phase difference. 
This goes as long as such an amplitude does not vanish. If it does, we can first project each of the 
states along a state \i), which is non-orthogonal to them, and take their projections, \i)(i\tjj(Q)) 
and \i)(i\ip(s)) to interfere. This leads to tfi(0, s). 

Wong, Cheng and Chu proved [6] that the projective phases at two states, \i) and \j), are 
related to one another by the following gauge transformation: 

exp[%(0,s)] = S ij (V(0))exp[^(0,s)]^(V(s)), (6.6) 
with the transition functions being defined as 

The projective phase can be used to prove that the Pancharatnam phase-change that arises 
when passing through an orthogonal state is either ir or zero (modulo 2ir). Furthermore, all off- 
diagonal phases can be obtained from the projective phase. For details, we refer to the literature 
[6]. 



Chapter 7 



Thomas rotation in relativity and 
in polarization optics 

Let us address a well known effect of relativity and see how it can be understood in terms of 
notions related to geometric phases. We recall that Thomas rotation arose as a surprising effect 
that passed unnoticed for several years, even to Einstein himself [24]. It is a consequence of 
the structure of the Lorentz group of transformations. These transformations relate to each 
other two incrtial reference systems, O and O 1 , whose respective coordinates we denote by 
and x'^ with /i, as all Greek indexes in this Section, running from to 3; x° — ct, and the x l 
denoting spatial coordinates (Latin indexes run from 1 to 3). Both reference systems, O and O' , 
are related by a Lorentz transformation, which is defined as one that leaves the quadratic form 
(.x ) 2 — (x 1 ) 2 — (x 2 ) 2 — (x 3 ) 2 invariant. Let us write this form with the help of the metric tensor 
Vnv = diag(l, — 1, — 1, —1) as rj^ v x^x v , where we have used the convention that a sum is assumed 
over repeated indexes. A Lorentz transformation, x^ — > x'^ = k^x v is given by those matrices A 
whose entries satisfy 

a^ t a; = »? w . (7.i) 

Lorentz transformations form a six-parameter Lie group, whose elements can be written as 
A = cxp L, with 

L= -w • ~S - ~C-K. (7.2) 

Here, S and K arc the group generators, while To = (lui,lu 2 ,uj 3 ) and £ ~ (Ci 3 C2 , C3) denote 
the six parameters in terms of which each group element is defined. The generators form an 
algebra, the Lie algebra of the group, which in the present case is defined through the following 
commutators: 



[Si,Sj] — tijkSk, (7.3) 
[Si,Kj] - e ijk K k , (7.4) 
[Ki,Kj] = -e ijk Sk (7.5) 

In Eq.(7.3) we recognize the commutators of the su(2) algebra, which is the algebra of the 
rotation group. Rotations are thus particular Lorentz transformations. On the other hand, the 
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Ki are generators of so-called "boosts" , those transformations which relate coordinate systems 
moving uniformly with respect to each other, and with their coordinate axes being parallel, i.e., 
~ex || ~e x', etc. Intuitively, if O and O' are related by a boost, and so also O' and O" , then we 
expect that the same holds true for the transformation relating O and O" . That is, if ~e x || ~e x i 
and e x > \\ e x n, then we expect that e x \\ e x n. The surprising discovery of Thomas was that it 
is not the case. Being parallel is not a transitive property, in the frame of Lorentz transformations. 
Technically speaking, the product of two boosts is not a boost, but some Lorentz transformation 
that can be expressed as a product of a boost by a rotation, the Thomas rotation. 

As almost all relativistic effects, in order to exhibit Thomas rotation we should consider two 
systems whose relative velocity is near the velocity of light: c. Otherwise, the effect is too small 
to be observed. However, there is an equivalent effect that appears in the context of geometric 
phases whose observation might be realizable with standard equipment. The root of Thomas 
rotation is the non-transitive property of boosts. As we have seen, Pancharatnam's connection 
relates also in a non-transitive way two polarization states. Intensity-preserving transformations 
of these states form a representation of the rotation group SU{2). But these are only particular 
transformations among more general ones which include intensity non-preserving transformations. 
The latter can be realized with the help of, e.g., polarizers, that is, dichroich optical elements. 
As we shall see, these elements provide us with the necessary tools for studying Thomas rotations 
in an optical framework. 

Before we discuss the optical framework, we need some more algebra to build the bridge 
connecting Lorentz transformations with polarization transformations. To this end, we recall a 
Lorentz invariant, quantum mechanical equation, the Dirac equation: 



(iVfy - m)i>{x) = 0, (7.6) 

with i){x) denoting a bi-spinor - a complex-valued four-component function of space-time x^ - 
and the 7 M being the Dirac matrices whose defining algebra is 



2rr 



(7.7) 



The vector space to which the bi-spinors ip(x) belong provides a representation-space for 
Lorentz transformations, just as space-time with vectors £ M does. Whereas in the latter Lorentz 
transformations are represented by matrices A = exp L, in bi-spinor space they are represented 
by matrices of the form 

S(n^) = exp (-^^y (7.8) 

with the antisymmetric coefficients fi M ^(A) = — f2„ M (A) representing the six independent param- 
eters that define each element of the group, just as the parameters in A = A(~lj , £ ) do. 

The commutation properties of the 7 M matrices allow us to write Sfilpv) in terms of Pauli 
matrices a . This is so because 5(12^) contains only even products of the 7 M matrices. Such 
products conform a subalgcbra of the (Clifford) algebra formed by the 7 M . This subalgebra 
is isomorphic to the Pauli-algebra (sometimes employed in polarization optics as quaternion 
algebra). We can then map each 4x4 matrix S^f^) into a 2 x 2 matrix T(~a, (3) given by 



T(~a, (3) = exp 



(a + i/) 



We see that T(~a, (3) is like an element of SU(2), exp 



i 5 ■ a 



(7.9) 



but having three complex 



coefficients (ix + i (3 in place of the three real ones S . The complex coefficients (ix + i (3 
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entail the six real parameters of the Lorcntz group. The representation of this group by matrices 
of the form given in Eq.(7.9) is what we need to make the connection with polarization optics, 
where matrices of such a kind represent polarization transformations. 

Polarization states corresponding to a monochromatic plane wave of light can be repre- 
sented by two-component vectors with complex components: (y x exp (i<f) x ) , v y exp (i(f) y )) T , so- 
called Jones vectors. Physically, only the relative amplitude v x /v y and the relative phase <p y — <f> x 
matter, so that a polarization state can be given in the form: 

Alternatively, polarization states can be represented by four-component Stokes vectors (sq, ~s), 
corresponding to a quantum-mechanical representation of pure states by density operators: 

p=|7r><7r| = i (/ + !>■-?). (7.11) 

Here, as we have taken a normalized ket \n) to represent the polarization state, So = Tr(\ir) (ir\) = 
1. This is the coefficient of the identity matrix I in Eq. (7.11). In general, the Stokes four- 
vector (s ,~s) = (Trp,Tr(p ■ ai),Tr(p ■ cr 2 ),Tr(p ■ cr 3 )). For a \ir) given as in Eq.(7.10), ~s = 
(cos(0) sin(2x), sin(0) sin(2%), cos(2x))- The set of unit vectors ~s span the Poincare-Bloch sphere 
and constitute a representation of polarization states (the standard Poincare sphere in optics 
corresponds to choosing a representation of Pauli matrices p given by p\ = 03, p 2 = <j\, 
A3 = 02). 

Intensity preserving transformations, like those realized by wave plates - examples of these 
are quarter-wave plates (QWP) and half-wave plates (HWP) - are represented by 2 x 2 matrices 
belonging to the SU (2) group. The effect of such a matrix on s is to rotate it without changing 
its length. A general matrix of the group SU(2) is given by exp (i5 n ■ p /2). Matrices Q(5) and 
H(S) representing a QWP and a HWP, respectively, are given by 

Q(S) = ^— [7-isin(25)<7i - icos(2<S)cr 3 ] (7.12) 
v2 

H(S) = -ism(26)a 1 -icos{26)a 3 . (7.13) 

By applying U = exp (i$~n ■ ~a /2) = cos ( < i ) /2) I + ism ( < I ) /2) n ■ ~~o to an initial polarization 
vector |7Tj) we obtain a transformed output vector \ir a ) = U\iTi). The corresponding Stokes 
vectors, ~Si andl? respectively, are related by the well-known Rodrigues formula [25] that gives 
a rotated vector in terms of the rotation angle and axis: 

~s a = cos(<i>)Vj + [1 - cos(<f>)] {~n ■ ~~Si) ~n + sin(<i>)V i x it. (7-14) 
This equation can be derived from 



\n )(ir \ = ^(I+^ -?) = U\n i )(ir i \tf (7.15) 

° OS (f ) / + * sin (f ) \ + ( cos (f ) /_ * sin (f 

Consider now dichroic optical elements like a non-ideal polarizer. To encompass optical con- 
ventions [26] let us use in what follows the ~p Pauli matrices: pi = 0-3, pi = ox, p% = (Ji- In such 
a representation the polarization vector as given in Eq.(7.10) corresponds to being x-polarized 
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when x = and y-polarized when x = 7r/2. In this case the matrix representing a non-ideal po- 
larizer whose lines of maximal and minimal transmission coincide with the x- and y-polarization 
axes, respectively, is given by 

Jdia 9 = ( ^ „° V (7.16) 



P y 

The eigenvectors of Jdiag, (1>0) T and (0,1) T , are thus polarization vectors along the x 
and —x directions, respectively, on the Poincare sphere. In order to obtain the correspond- 
ing matrix whose eigenvectors are \tti) = (cos x, e 1 ^ sin x) and its orthogonal vector ^2) = 
(—e _J<i6 sinx, cos x) T , we submit Jdiag to the unitary transformation 

J=UJ diag U\ (7.17) 
with U being the matrix whose columns are |7Ti) and \tt 2 ): 

U=( COSX -e _i0 sinx \ (718 x 
\_ e'^sinx cosx / 

This gives for J a matrix that can be written as 

J= (^±Py^j I+ {^^j [(cos2x)/9i + (sin2xcos0)p 2 + (sin2xsin^)p 3 ]. (7.19) 

Assuming that the transmission axis is x, i.e., that p x > p y , we write p x = cxp(— a<), 
p y = cxp(— a>) and set T = (cos2x,sin2xcos</>,sin2xsin(/>). Then, 

J = cx P (-^) {-h (^^) / + sinh (^< ) F • 7?} . (7.20) 

Now, Eq.(7.9) can be shown to be just of this form. Indeed, in order to show this we write 
T(~a , (3 ) = cxp(— / • ~p), with / = ~a + i (3 . The matrix / • ~p has the following (in general 
complex) eigenvalues: 



A± = ±^~a 2 + 2i~a -7?) = ±z. (7.21) 

Let us denote by |f±) the eigenvectors of / • ~~p ; that is, ^ / • ~p^j |f±) = A± |f±). Thus, we 
have 



/ = |f + )(f + | + |f_)(f_| (7.22) 
7-7? = A + |f + )(f+| + A_|f_)(f_|. (7.23) 

Solving for |f±) (f±| it follows that 

\f±){f±\ = Zl± l' y (7.24) 

We apply now the general decomposition cxp A — ^ n exp a n \a n ) (a n | to the present case, i.e., 
A = — f -~p . The operator cxp ^— / • ~p ) has eigenvectors |f±) and eigenvalues exp (t z )- Thus, 



39 



exp(-7-7) = e-|f + )(f + | + e z |f_)(f_| = ^(^ + f-7) + ^(^-7-7) 

= (^^)/-(^^)7-7? = coshz-sinhz^|^.7? (7.25) 

The Lorcntz transformation that exp(— / -7?) represents can generally be written as a product 
of a boost by a rotation. It is clear from Eq.(7.9) that a rotation is obtained when a = and 
a boost when (3 = . As we have seen, a general rotation U € SU{2) can be implemented 
with the help of three wave-plates as Q({33)H(@2)Q(Pi), see Eq.(4.6). A general boost can be 
implemented with dichroic elements that realize the transformation given in Eq.(7.20). The 
global factor there, exp (— (a> + a<) /2), corresponds to an overall diminution of intensity only. 
We can thus realize any exp(— / • ~p) by using optical elements like wave-plates and dichroic 
elements. In particular, by letting a polarization state pass through two consecutive dichroic 
elements - each corresponding to a boost - we could make appear a phase between initial and 
final polarization states. This is a geometric phase whose origin is a rotation induced between 
the polarization states: a Thomas rotation. Such an effect can thus be easily exhibited in the 
domain of polarization optics, in contrast with the relativistic case, where one needs to reach 
velocities near the velocity of light. Moreover, we have here another nice example showing the 
common topological root of two phenomena, that at first sight might appear to be unrelated to 
one another. 
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7.1 Intcrfcrometric arrangement for testing Pancharatnam's phase &tot- Light from 
a He-Ne laser (L) passes a polarizer (P) and enters a beam expander (E), after 
which half of the beam goes through one polarizer (Pi) and the other half goes 
through a second polarizer (P2), orthogonally oriented with respect to the first. 
The two collinear beams feed the same Mach-Zchndcr interferometer (BS: beam- 
splitter, M: mirror), in one of whose arms an array of three retarders has been 
mounted (Q: quarter-wave plate, H: half-wave plate), so as to realize any desired 
SU(2) transformation. This transformation introduces a Pancharatnam's phase 
&tot = 6 on one half of the beam and an opposite phase <I> tot = —5 on the other, 
perpendicularly polarized half, so that the relative phase of the two halves equals 
26. From the relative shift between the upper and lower halves of the intcrfcrogram 
that is captured by a CCD camera set at the output of the array one can determine 
$ tot . Any instability of the array affects both halves of the interferogram in the 
same way, so that the relative shift 26 is insensitive to instabilities 

7.2 Polarimctric arrangement for testing Pancharatnam's phase Qp. With an array of 
five retarders (Q:quarter-wave plate, H: half-wave plate) and two polarizers (P) 
a relative phase <f> between two polarization components |±) z can be introduced, 
on which any desired SU(2) transformation can be applied. The five retarders 
are simultaneously rotated, thereby varying </>, and the intensity I ((f) is recorded. 
From the maximum and minimum values of I one can determine <&p, according to 
cos 2 ($ tot ) = 7 min /(l - 7 max + J min ) 

7.3 Pancharatnam's phase can be extracted from the relative fringe-shift between the 
upper and lower parts of the interferogram. The relative shift equals twice the 
Pancharatnam's phase. The left panels show the result of performing a column 
average of the fringes plus the application of a Savitzky-Golay filter to get rid of 
noise features. The column average is performed after selecting the evaluation area 
Rq on the interferogram, as illustrated on the right panel. The reported shifts are 
mean values obtained from four different selections, Rq, . . . , R3, of the evaluation 
area 

7.4 Experimental results from the interfcromctric measurement of Pancharatnam's 
phase. We plot cos 2 (<I> tot ) as a function of £ and 77, with ( being held fixed to zero. 
In the upper panels we plot the single curves that arc highlighted on the surface 
shown on the lower panel. Dots correspond to experimental values, some of which 
fall below and some above the surface 
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7.5 Experimental results from a polarimetric measurement of Pancharatnam's phase. 
The upper graphs correspond to an array that consists of three retarders set in 
the form QQH (left) and QHQ (right). Parameter values are as indicated and 
cos 2 (<I> tot ) was measured as a function of ij. The lower curve corresponds to the 

full array of five retarders set in the form QQQHQ 49 

7.6 Interfcromctric measurement of the visibility v{6\, 82, 63). The left panels show 
the surfaces obtained by fixing one of the three angles, 61, as indicated. The right 
panels show the experimental results that correspond to the curves highlighted on 
the surfaces. The upper curve is obtained by fixing 6*3 besides 9\, the lower curve 
by fixing 9 2 and 9\. In the upper curve all experimental values fall below the 
predicted (maximal) visibility of 1. This is because I m % n is never zero, as required 
to obtain v = 1. By subtracting the nonzero average of I min the experimental 
points would fall above and below the theoretical curve, as it occurs for the lower 
curve, which corresponds to v < 1 50 



Figures are taken from Ref. [23] 
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Figure 7.1: Interferometric arrangement for testing Pancharatnam's phase <&tot- Light from a 
He-Ne laser (L) passes a polarizer (P) and enters a beam expander (E), after which half of the 
beam goes through one polarizer (Pi) and the other half goes through a second polarizer (P2), 
orthogonally oriented with respect to the first. The two collinear beams feed the same Mach- 
Zchnder interferometer (BS: beam-splitter, M: mirror), in one of whose arms an array of three 
retarders has been mounted (Q: quarter- wave plate, H: half- wave plate), so as to realize any 
desired SU(2) transformation. This transformation introduces a Pancharatnam's phase $ tot = S 
on one half of the beam and an opposite phase <&tot = — 8 on the other, perpendicularly polarized 
half, so that the relative phase of the two halves equals 25. From the relative shift between 
the upper and lower halves of the interferogram that is captured by a CCD camera set at the 
output of the array one can determine $>tot- Any instability of the array affects both halves of 
the interferogram in the same way, so that the relative shift 25 is insensitive to instabilities. 
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Figure 7.2: Polarimetric arrangement for testing Pancharatnam's phase $p. With an array of 
five retarders (Q: quarter-wave plate, H: half-wave plate) and two polarizers (P) a relative phase 
4> between two polarization components |±} z can be introduced, on which any desired SU(2) 
transformation can be applied. The five retarders are simultaneously rotated, thereby varying 
(j>, and the intensity I((f>) is recorded. From the maximum and minimum values of / one can 
determine <J> P , according to cos 2 (4> tot ) = I min /(1 - /max + imin)- 
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Figure 7.3: Pancharatnam's phase can be extracted from the relative fringe-shift between the 
upper and lower parts of the interferogram. The relative shift equals twice the Pancharatnam's 
phase. The left panels show the result of performing a column average of the fringes plus the 
application of a Savitzky-Golay filter to get rid of noise features. The column average is performed 
after selecting the evaluation area Rq on the interferogram, as illustrated on the right panel. 
The reported shifts are mean values obtained from four different selections, Rq, . ■ ■ ,i?3, of the 
evaluation area. 
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Figure 7.4: Experimental results from the interferometric measurement of Pancharatnam's phase. 
We plot cos 2 (<i> tot ) as a function of £ and 77, with £ being held fixed to zero. In the upper panels 
we plot the single curves that are highlighted on the surface shown on the lower panel. Dots 
correspond to experimental values, some of which fall below and some above the surface. 
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Figure 7.5: Experimental results from a polarimetric measurement of Pancharatnam's phase. 
The upper graphs correspond to an array that consists of three retarders set in the form QQH 
(left) and QHQ (right). Parameter values are as indicated and cos 2 (<l> tot ) was measured as a 
function of rj. The lower curve corresponds to the full array of five retarders set in the form 
QQQHQ. 
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Figure 7.6: Interferometric measurement of the visibility v{6\, 62, O3). The left panels show the 
surfaces obtained by fixing one of the three angles, 9\, as indicated. The right panels show the 
experimental results that correspond to the curves highlighted on the surfaces. The upper curve 
is obtained by fixing #3 besides 6±, the lower curve by fixing #2 and B\. In the upper curve 
all experimental values fall below the predicted (maximal) visibility of 1. This is because I m i„ 
is never zero, as required to obtain v = 1. By subtracting the nonzero average of I m in the 
experimental points would fall above and below the theoretical curve, as it occurs for the lower 
curve, which corresponds to v < 1. 



